List decoding of repeated codes 



Fernando Hernando*, Michael O' Sullivan^ and Diego Ruano* 
*Department of Mathematics, Universidad Jaume I, Spain 
Email: carrillf@mat.uji.es 
"''Department of Mathematics and Statistics, San Diego State University, USA 

Email: mosulliv@math.sdsu.edu 
^Department of Mathematical Sciences, Aalborg University, Denmark 
Email: diego@math.aau.dk 

p.^ I Abstract 

(<~^ . Assuming that we have a soft-decision list decoding algorithm of a linear code, a new hard-decision list 

psj ■ decoding algorithm of its repeated code is proposed in this article. Although repeated codes are not a clever way 

of encoding data, due to their parameters, we show that they have a good performance with this algorithm. We 
■ compare, by computer simulations, this algorithm for the repeated code of a Reed-Solomon code against a list 

pLn . decoding algorithm of a Reed-Solomon code. Finally, we estimate the decoding capability of the algorithm for 

Reed-Solomon codes. 
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> 



List decoding was introduced by Elias flU and Wozencraft [[T2|. A list decoder can produce several 
candidate codewords near the received vector, thus relaxing the requirements of unique decoding and 



00 I enabling the possibility of decoding beyond half of the minimum distance. The efficient list decoding 
^ ■ problem was unsolved for many years until Sudan ifTTIl provided an algorithm for low rate Reed-Solomon 
^ ■ codes. Later Guruswami and Sudan [|5]| gave a general answer for Reed-Solomon codes that has 2 steps: 

' the interpolation step and the root finding step. 
Q ; A soft-decoding algorithm works under the assumption that the output of the channel is probabilistic 
^ ■ information for the reliability of each input data. Koetter and Vardi considered a soft-decision list decoding 
. . . algorithm for RS codes based on the interpolation techniques of Guruswami-Sudan [6|. The probabilistic 
. ^ I information is translated into a multiplicity matrix, which is considered to be the input of the soft- 
^ ; decision decoding algorithm. Koetter and Vardi considered a soft-decision list decoding algorithm for 
■ RS codes based on the interpolation techniques of Guruswami-Sudan [6J. This algorithm has 3 steps, 
first the multiplicity assignment step where a set of points with multiplicities is defined using reliability 
information provided by the channel. The other two steps are the interpolation and root finding steps. 

In this paper we consider the repeated code of a linear code C c ¥'^, that is = {(c, . . . , c) : c 6 C} c F^", 
for some £ >2. We present a (hard-decision) list decoding algorithm for repeated codes, based on a soft- 
decision list decoding algorithm for the constituent code. In our algorithm, the multiplicity step is based 
on the algebraic structure of the code, instead of on the information from the channel. Namely, it is 
an interpolation problem for a single block taking into account the information provided by the whole 
codeword. As far as the authors known, this is a novel idea in interpolation decoding. 

Repeated codes do not have good parameters but the minimum distance of a linear code is just a rough 
estimate of the list-decoding performance of the code. We show that repeated codes of Reed-Solomon 
codes using the algorithm in [[8] and taking into account our new set up for the multiplicities, may have 
similar decoding capability to a Reed-Solomon code, with the same information ratio and length, using 
the algorithms in |[T1, [Q. However, significant difi'erences in the computation time are observed since the 
complexity of the first simulation depends on n instead of in £n. 



Finally, we estimate the decoding capability of the algorithm for Reed-Solomon codes. Even though 
the bounds assume certain properties of the error vector, they are sharp in the examples. 

The paper is organized as follows: In Section |Il] we recall the soft-decision interpolation problem for 
RS codes. In Section [ni] we introduce our list decoding algorithm for repeated codes. In section JVl 
we present some simulations of the decoding algorithm with MAGMA [2] and interpret the results. We 
estimate the number of errors t that we can decode in section |Vl Section |VI] finishes the article. 

II. Soft-Decoding 

We recall the soft-decoding algorithm for Reed-Solomon codes in |[8l, which will be used for our 
computer simulations. Let ai, . . . ,a„he n different points of the finite field with q elements and C be 
the Reed-Solomon code with parameters [n, k, d] defined as 

C = mai), h{a2), Kan)) : deg(/z) <k-\}, 

where a is a primitive element in F^. 

Consider the {l,k - l)-weighted degree of a monomial x'y, degi,^i(x'y^) = i + j(k - 1). Given two 
monomials x^'y'^ and x'y\ we define the monomial order >k-i over the polynomial ring F£^[x,j]: we say 
that x"y'^ >k^i x'yj if either deg^_i(x"y^) > deg^^i(x'y) or if degf._i(xf^y'') = deg^^i(x'y) and b > j. 

The interpolating polynomial problem consists of finding the minimal polynomial Q(x,y) e F^^[x,3;] 
with respect to the monomial order >^_i such that it has a zero of multiplicity at least at the point 
Pi,^ = (o:i,/3) for ?' = \, . . .,n and /3 e F^, where the multiplicities m,;^ are given, usually from reliability 
information provided by the channel [l6i. 

Consider M = {(pijs, mijs) : i = I, . . . ,n;/3 eV^} and the ideal in ¥q[x, y] of polynomials interpolating at 
the points of M with the desired multiplicities: 

lM = {f& V^[x,y] : multp(/) > m for {p,m) £ M}. 

For each r = (ri, . . . , r„) e F[] we define its score as: 



score(r) = ^ m,-,... 



i=i 



We denote by hr the interpolating polynomial at the points (a,, r,) for z = 1, . . . , n. If score(c) > deg^_j(/) 
then y - he is a root of / as a polynomial in y. 

The requirement that Q{x,y) pass through (ai,/3) with multiplicity m,^ imposes ('"'j^^) conditions, so 
overall we have 

conditions. Hence, we look for an interpolating polynomial of the form 2(i,7)es fij^'y^^ where S c {(z, j) : 
z, j > 0} is a subset of indices with |5 1 = A'^ where 

;=1 /?eF, ^ 

One method for obtaining Q(x,y), for example fS], is to compute a Grobner basis of Im with respect 
to the {l,k - l)-weighted degree and pick the smallest element in it. There is an upper bound for the 
(l,k- l)-weighted degree of a polynomial Q(x,y) given that N conditions are imposed (see (81 Proposition 
3] which thereby bounds the y-degree of Qix,y). 



III. List decoding of repeated codes 

Let C c F^^ be a Reed-Solomon code with parameters [n, k, d] and generator matrix G. We will describe 
the algorithm for a Reed-Solomon code, however, the algorithm can be extended in a straightforward 
manner to any linear code provided with soft-decision list-decoding algorithm. 

We consider the repeated code of C, 

C' = {(c,...,c):ceC}, 

which has parameters [£n,k,£d] and generator matrix (G| ■ ■ ■ |G) IfTOl Problem 17 of Ch. 1]. We will 
describe a hard-decision list decoding algorithm for by using a soft-decoding algorithm for C, thus 
we only have to define the matrix of multiplicities from a received word. 
Let c = (ci, . . . , c„) e C then a typical codeword of is of the form 

c = (ci, . . . , c„, . . . , Ci, . . . , c„) e F^". 

One can also understand a vector of length £n as an £ x n matrix. Hence, if c e M(£ x n, F^) we denote 
by dj the entry corresponding to the j-th row (block) and the i-th column (position). According to this 
notation a word in F^" may be represented as 

C — (Cj , . . . , C^, . . . , Cj , . . . , Cjj). 

Let c be the sent word and r = c -I- e the received word with error weight r = wt(e). We have that 
r = (r}, . . . , r^, . . . , Tj, . . . , and c = (c\, . . . ,cl, . . . ,c[, . . . , c,^). Since c e C", one has that 

cj = c], for every z e { 1 , . . . , n} and j,k e {I, . . . ,£}. 

Therefore, r^. = r'f if and only if ej = e'^. Hence if the received word has the same value in several 
positions it is likely that the error value in these positions is zero. Namely, the more positions where the 
values agree the more likely that these positions are error free. Moreover, the bigger the base field F^ is 
more likely is. Based on this fact we will define the multiplicities m,y} at the point p,^} = (Qr;,/3) for the 
soft-decoding algorithm of C. 
For a received word r, we define 

mi,f, = \{je{\,...,i}:ri=/3}\, (2) 

for i = 1, . . .,n and JS eF^. 

Finally, we will consider the soft-decision algorithm for C (for instance dH) with these multiplicities. 
Trivially, we have that the computational complexity of this algorithm for equals the computational 
complexity of the soft-decoding algorithm for C. For instance it is 0{n^mL^) using f8], where L is the 
list size and m = max{mjjj : j = I, . . .,n and /3 e F^^}, the multiplicity. 

Let us investigate the score of a block of a codeword, that is a codeword in C. 

Lemma 1: Let r = c -i- e be a received word and c = (c, . . . , c) be the sent word with wt(e) = t. Then 

n 

score(c) = ^ m,- = €n - t. 

i=l 

Proof: Note that for i = I, . . . ,n, one has that m;^^, = i - wt(e^j, e'). ■ 
If r = wt{e) is "adequate" then we can guarantee that c = (c, . . . , c) is in the output list. In section |V] we 
explain what we mean by "adequate" values for r and we provide bounds for r in order to successfully 
decode this family of codes. 

Remark 2: A decoding algorithm for a repeated code can be obtained decoding every block rj of 
the received word r for / = until the decoded block c'. verifies that (c', ...,c') is at distance 

l(ld - 1)/2J of the received word. This is the approach of |[7l for quasi-cyclic codes, that is when C is 
cyclic. One could also consider as a convolutional code [3]. However, all this approaches will have 
bad performance due to the poor parameters of the repeated codes. 
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Fig. 2. List Decoding [93, 15,79] RS code over F? 
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IV. Computer experiments 

We have implemented in MAGMA the decoding algorithm described in section Hill given a random 
codeword c in a \n,k,d\ Reed-Solomon code we consider the repeated codeword (c, ...,c) e C^. In 
particular we choose £ = 3. We consider t errors in r uniformly distributed random positions among 
the £n positions, so it is possible that more than one error may occur in position /. We apply quadratic 
interpolation decoding (list size 2) using the algorithm in [|8l with the prescribed multiplicities as in section 

imi 

We ran the procedure 1000 times each for several values of t. We compare the performance with the 
performance of the list-decoding algorithm [|T1 applied to a Reed-Solomon code with parameters [£n, k, id\. 
We considered the same values for t and compared the results with our new approach. Since both codes 
have the same information rate this comparative is fair, however, it is not favourable to us since the base 
field of the repeated code is smaller. 

1) The block code is a RS code with parameters [31, 15, 17] over Fj so the repeated code with € = 3 
has parameters [93, 15,51]. With a classical decoding algorithm one can decode at most 25 errors. 
Our simulations in Fig [T] show that we can uniquely decode about 38 errors. We compare it with list 
decoding the [93, 15,79] RS code over F? using multiplicity three and list size 2. We can decode 
about the same amount of errors but it takes 12 times as long as our approach, see Fig |2l 

2) The block code is a RS code with parameters [63, 14,50] over Fe so the repeated code with i = 3 
has parameters [189, 14, 150]. With a classical decoding algorithm one can decode at most 74 errors. 
Our simulations in Fig |3] show that we can uniquely decode about 113 errors. We compare it with 
list decoding the [189, 14, 176] RS code over Fs using multiplicity three, but we can not decode 
113 errors using list size 2 as before. We need list size at least 5, which increases the computational 
demand significantly. We can decode about the same amount of errors but it takes about 118 as 
long as our approach, see Fig HI 

3) The block code is a RS code with parameters [63,40,24] over Fg so the repeated code with i = 3 
has parameters [189,40,72]. With a classical decoding algorithm one can decode at most 35 errors. 
Our simulations in Fig |5] show that we can uniquely decode about 63 errors. We compare it with list 
decoding the [189,40, 150] RS code over Fg using multiplicity three and list size 2. It can decode 
slightly more errors than the repeated code, but it takes about 6 times as long, see Fig |6l 

4) The block code is a RS code with parameters [63, 54, 10] over Fg so the repeated code with i = 3 has 
parameters [189, 54, 30]. With a classical decoding algorithm one can decode at most 14 errors. Our 
simulations show that we can uniquely decode about 30 errors. We compare it with list decoding the 
[189,54, 136] RS code over Fg using multiplicity three and list size 2. It can decode significantly 
more errors, but it takes about 4 times as long for decoding a single vector. 

These experiments clearly show advantages to using a repeated code and our method for decoding 
as compared to using a low rate Reed-Solomon code. The decoding complexity is much less and the 



Fig. 3. List Decoding [189, 14, 150] repeated code over ¥(, with constituent code [63, 14,50] RS code 
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Fig. 4. List Decoding [189, 14, 176] RS code over Fg 
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correction capability is quite similar, sometimes better for the repeated code. 

V. Bounds for the correction capability 

Let C be a Reed-Solomon code with parameters [n, k, d\. Let c = (c, . . . , c) G C'^ be the sent codeword 
and r = c + e the received one, with error weight r = w?(e). We will assume from now on that if two 
errors occurs at position i and blocks j, k then rj rj". We will omit some details because of the space 
constraint. 

For i = I, . . . ,n we denote by r, the number of blocks containing an error at position i. 

From section |ll] we know that we essentially need a bivariate polynomial passing through the points 
with multiplicities m,y3, where the multiplicities are those described in section [nil Obviously m,;^ depends 
on the errors t,. Assume that r = 21Li '^i errors have occurred. Let us compute how many conditions 
impose a bivariate polynomial Q{x,y) passing through the points pi^js with multiplicities m,;^: For each 
position i = 1, . . .,n there are / - r, blocks with no error that impose *^~^'^^2~^'^'^ conditions. Moreover, 
there are r, blocks with an error that impose exactly one condition. Summing along the different positions 
we get: li"=i[T,- + ifilillLZilil]^ expanding this sum and taking on account that 2"Ii = t we get 

n(f + i) ^ Ti + iTii - t] 
T H — 



-E^^^ (3) 



1=1 

equations. 

For a >0 and < b < k - 2 let Pa,b be the set of monomials with {\,k- 1)- weighted degree less than 
or equal to a{k - I) + b. Let M be the multiplicity matrix derived from r. The following two conditions 
guarantee successful decoding. 

(i) There is a polynomial in Pa,b ^ ^m- This is true when the number of monomials within Pa,b is bigger 
than the number of conditions imposed by M, the number of monomials in Pa,b is 

{k - I) + 2{k - I) + ■ • ■ + a{k - \) + {a + \){b + 1) = ^^^^ ^\ k - 1) + (a + \){b + 1). (4) 



2 



Therefore, comparing ([3]) with (H)) the condition we get is: 



a(a+\) n(f + €) ^Ti + lTii-rj 
(k - l)-^— ^ + (b+ \)(a + 1) > T + ^ - ^ ^ '- + 1. (5) 

(=1 

(ii) Any polynomial in Pajy must have y - /i, as a factor. From [[8l Proposition 1] this occurs if the score 
of y - he is bigger than the {l,k - l)-weighted degree of Q(x,y): 



in-T > a(k - I) + b. 



(6) 



Fig. 5. List Decoding [189,40,72] repeated code over Fg with constituent code [63,40.24] RS code 



T 


55 


60 


63 


65 


70 


72 


75 


NumherOfS uccess 
1000 


1 


1 


1 


.999 


.972 


.929 


.810 


Time (seconds) 


127.630 


129.750 


130.280 


127.160 


127.350 


128.930 


126.060 


Fig. 6. List Decoding [189,40, 150] RS code over Fg 
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We want to provide an upper bound for r such that conditions ([5]) and Q are both satisfied. We use 
the following strategy. Since we want to guarantee condition Q we impose 

in-T-\ =a(,k-\) + b. (7) 

Now we have to check when condition ^ is true taking into account (|7]). So we substitute b = £n - t - 
1 - a(k - 1) in ([5]) obtaining, 

{a + 2)T < - ^ ^ ' + J] ^ '- -1-ik- 1 )-^^ + Ma + 1 ). (8) 

i=l 

The latter formula depends on the number of errors r, in each block, which we do not know. We can not 
go further without making any assumption about the r,. We divide the problem into two cases ^ < 2 
and ^ > 2. In each case we will consider different assumptions about r,. 
(A) Assume that T = ^ < 2. 

In particular, the rate ^ of the block code is greater than |. Thus, the dimension is relatively large and the 
minimum distance is small so we expect to decode a small number of errors. It is therefore reasonable 
to assume r, < 1 for every i. Since < r,- < 1 we have that = r, (r, is either zero or one), therefore 

Z'U ''^^f'^' = Z"=i ''^^f''' = Z"=i -r/ = t£. The last equality is because by definition Z'U = r. 



Hence substituting 2"=i ^'^^^2^ ^' by t£ in dH) and collecting every r in the left hand side we get: 

^ ^ .^o • (9) 

a - I + 2 

This bound still depends on a which itself depends on r. Notice that since b = ln-T-\ - a{k - 1) 
then a = ^"'J'"' - ^ with b < k - 2, then we have that 



a < 



k-l ■ 

fn(a+^-^)-(k-l)^-l 



We would like to substitute a by j^- in One can check that the right hand side H(a) = ^^^^^^ 
is a decreasing function in a. Thus, we can replace a by obtaining the next bound for t. 

Theorem 3: If t, < 1 V?, ^ > 3 and a > I - 2 then we can decode at least r errors for r < 
^ni^^k-^yj^^-^ ^here A = f^. 

The previous bound is good enough but doing one more assumption we can can obtain a compact and 
beautiful bound. 

Corollary 4: li a < £ then we can decode r < J^^^ - ^ errors. 

Since we have assumed a <l one might ask when does it hold. We provide a partial answer to it. 



Lemma 5: If r > ^ and 7^- < 2 then a<€ 
(B) Assume that T = ^ > 2. 

The inverse of F, is roughly the rate of the block code, so this case corresponds with codes with 
minimum distance larger than dimension, so we expect to decode a relatively large number of errors. 
Therefore, we will assume that r,- > - 2 V? which implies t >{£ - 2)n. If we replace r,- by (-£ - 2) in ([8]) 
we get, 

-n(^ + 3)-l ,, a(a + l) , , 

T < (k- 1) h cn. 

(a + 2) 2(a + 2) 

Unfortunately the right hand side in the previous inequality, J (a) = - (k - i)^^ + in, it is 

neither an increasing nor a decreasing function in a. Hence we make further assumptions to obtain the 
following results. 

Theorem 6: If Ti> t - 2, 7(a) is an increasing function in a and a> € then we can decode at least r 
errors where 

-n(^ + 3)-l ^, ,Mi+l) , 

T< — (k-l)— -+£n. (10) 

(^ + 2) ^ ^2(^ + 2) ^ ^ 

Theorem 7: If Ti>£- 2, 7(a) is a decreasing function in a and a <2i then we can decode at least t 

errors where , ^ , ^ . ^ . , 

„(,,3)-l 2^2^.1) 

~ (2^ + 2) ^ 2(2^ + 2) ^ ^ 

Remark 8: Along this section we did several assumptions, so it seems that the result are a bit weak. 
But if one compute these bounds for the examples in previous sections turns out to be quite tight. 

. For the [31, 15, 17] RS code and £ = 3 Theorem [6] tell us that we can decode at least 38 errors which 

coincide with the computational simulations. 
. For the [63, 14,50] RS code and £ = 3 Theorem |7] tell us that we can decode at least 107 errors 

which which is fairly good. 
. For the [63,40,24] RS code and ^ = 3 we can Corollary |4] tell us that we can decode at least 71 

which is fairly good but if we use Theorem |4l it tell us that we can decode at least 65 errors which 

coincide with the computational simulations. 
So we claim that we can use our bounds as a good estimation of the real decoding capacity, indepen- 
dently of the assumptions. 

VI. Conclusion 

An efficient list-decoding algorithm for repeated codes, in particular for Reed-Solomon codes is pre- 
sented. The theoretical and experimental results show that repeated codes with this algorithm have better 
performance than Reed-Solomon codes and that they could be used in practice when a code with low rate 
is needed. 
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